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Frame memory: a storage architecture to support rapid design and implementation of 
efficient databases 

Salvatore T. March, Dennis G. Severance, Michael Wilens 

September 1981 ACM Transactions on Database Systems (TODS), volume 6 issue 3 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
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Full text available: f |pdf(1.43 MB) 



Frame memory is a virtual view of secondary storage that can be implemented with 
reasonable overhead to support database record storage and accessing requirements. 
Frame memory is designed so that its operating characteristics can be easily manipulated 
by either designers or design algorithms, while performance effects of such changes can 
be accurately predicted. Automated design procedures exist to generate and evaluate 
alternative database designs built upon frame memory, and the existenc ... 

Keywords: analytic modeling, database design system, virtual secondary storge 



Scalable statistical bug isolation 

Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, Michael I. Jordan 
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Full text available: fl? ) pdf(180.44 KB) 



We present a statistical debugging algorithm that isolates bugs in programs containing 
multiple undiagnosed bugs. Earlier statistical algorithms that focus solely on identifying 
predictors that correlate with program failure perform poorly when there are multiple 
bugs. Our new technique separates the effects of different bugs and identifies predictors 
that are associated with individual bugs. These predictors reveal both the circumstances 
under which bugs occur as well as the frequencies of fail ... 

Keywords: bug isolation, feature selection, invariants, random sampling, statistical 
debugging 
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Full text available- ffl pdf(4.40 MB) Additional Information: full citation, abstract, references, citings, index 
^ terms 

This paper describes a system for constructing graphical user interfaces following a two- 
view paradigm: one view contains a textual representation of the interface in a special- 
purpose, "little" language, and the other view contains a direct manipulation, interactive 
editor for the user interface. The user interface can be edited in either view, and the 
changes are reflected in the other view. The language allows dialog boxes to be expressed 
in a simple and natural way, and has a well-defined ma ... 

4 Large-scale resources: The automatic creation of lexical entries for a multilingual MT Q 
system 

David Farwell, Louise Guthrie, Yorick Wilks 

August 1992 Proceedings of the 14th conference on Computational linguistics - 
Volume 2 

Publisher: Association for Computational Linguistics 

Full text available: l jg?) pdf(436.57 KB) Additional Information: full citation , abstract , references , citings 

In this paper, we describe a method of extracting information from an on-line resource for 
the construction of lexical entries for a multi-lingual, interlingual MT system (ULTRA). We 
have been able to automatically generate lexical entries for interlingual concepts 
corresponding to nouns, verbs, adjectives and adverbs. Although several features of these 
entries continue to be supplied manually we have greatly decreased the time required to 
generate each entry and see this as a promising method f ... 

5 WHAT: an XSLT-based infrastructure for the integration of natural language 
processing components 
Ulrich Schafer 

May 2003 Proceedings of the HLT-NAACL 2003 workshop on Software engineering 
and architecture of language technology systems - Volume 8 SEALTS '03 
Publisher: Association for Computational Linguistics 

Full text available: * ^pdf(1 58.89 KB) Additional Information: full citation , abstract , references 

The idea of the Whiteboard project is to integrate deep and shallow natural language 
processing components in order to benefit from their synergy. The project came up with 
the first fully integrated hybrid system consisting of a fast HPSG parser that utilizes 
tokenization, PoS, morphology, lexical, named entity, phrase chunk and (for German) 
topological sentence field analyses from shallow components. This integration increases 
robustness, directs the search space and hence reduces processing ti ... 

6 Design of an interpretive environment for Turing 
Jfa James R. Cordy, T. C. N. Graham 

^ July 1987 ACM SIGPLAN Notices , Papers of the Symposium on Interpreters and 
interpretive techniques SIGPLAN '87, Volume 22 issue 7 
Publisher: ACM Press 

Full text available: ^ pdf(514.77 KB) Additional Information: full citation , abstract , citings , index terms 

This paper presents the design of an interpreter structure for modern programming 
languages such as Turing and Modula II that is modular and highly orthogonal while 
providing maximal flexibility and efficiency in implementation. At the outermost level, the 
structure consists of a front end, responsible for interaction with the user, and a back end, 
responsible for execution. The two are linked by a single database consisting of the 
tokenized statements of the user program. Interfaces between the ... 
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7 A formal protection model of security in centralized, parallel, and distributed systems Q 
0^ Glenn S. Benson, Ian F. Akyildiz, William F. Appelbe 

August 1990 ACM Transactions on Computer Systems (TOCS), volume 8 issue 3 

Publisher: ACM Press 

Full text available* W\o6U2.M MB) Additional Information: full citation , abstract , references , citings , index 

terms , review 

One way to show that a system is not secure is to demonstrate that a mal icious or 
mistake-prone user or program can break security by causing the system to reach a 
nonsecure state. A fundamental aspect of a security model is a proof that validates that 
every state reachable from a secure initial state is secure. A sequential security model 
assumes that every command that acts as a state transition executes sequentially, while 
a concurrent security model assumes that multiple commands execut ... 

Keywords: access control, concurrency control, distributed system security, operating 
system security, protection model 



8 Aspen language specifications U 
lt> Thomas R. Wilcox 

November 1977 ACM SIGPLAN Notices, Volume 12 issue n 

Publisher: ACM Press 

Full text available: ^ pdf(1.38 MB) Additional Information: full citation , abstract , citings 

ASPEN is a "toy" language that has been designed for use in the teaching of compiler 
construction and is therefore a curious amalgam of highly refined language features that 
manage to barely cover the full spectrum of algorithmic languages. One selection 
statement is made to do the work of IF-THEN-ELSE and CASE, but with certain severe 
restrictions on the conditions that may be written, on the other hand, iterations must be 
written as a primitive infinite loop with explicit conditional exits. Onl ... 

9 Optimized System Synthesis of Complex RT Level Building Blocks from Multirate Q 

Dataflow Graphs 

Jens Horstmannshoff, Heinrich Meyr 

November 1999 Proceedings of the 12th international symposium on System 

synthesis 
Publisher: IEEE Computer Society 

Full text available: f& pdf(160.15 KB) 

^jT Additional Information: full citation , abstract , citings 

W Publisher Site 

In order to cope with the ever increasing complexity of todays application specific 
integrated circuits, a building block based design methodology is established. The system 
is composed of high level building blocks of which some are reused from previous designs 
while others might have been created by behavioral synthesis. In data flow oriented 
designs, these blocks usually have complex non-matching interface properties, making it 
necessary to generate additional interfacing and controlling hard ... 

10 Musical information retrieval using melodic surface 
Massimo Melucci, Nicola Orio 

August 1999 Proceedings of the fourth ACM conference on Digital libraries 
Publisher: ACM Press 

Full text available: f £| pdf(674.04 KB) Additional Information: full citation , references , citings , index terms 
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11 A storage and access manager for ill-structured data Q 

Jeffrey Kottemann, Michael Gordon, Jack Stott 
>r August 1991 Communications of the ACM, volume 34 issue 8 

Publisher: ACM Press 

t- .. * ^ -. u. dot ha mdv Additional Information: full citation , abstract , references , index terms . 

Full text available: T?1 pdf(2.Q4 MB) — : 

t- 1- ^ review 

Database management systems are powerful tools for processing large volumes of 
structured, or normalized, data. Much of the data to be stored in computer systems, 
however, differs from normalized data in both its logical uses and the storage structure 
required for its effective management. For instance, Van Rijsbergen (1979) distinguishes 
database retrieval from information retrieval (IR)— the retrieval of references to text— by 
c ... 
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1 Improved search ranking: Optimizing scoring functions and indexes for proximity 

search in type-annotated corpora 
■ Soumen Chakrabarti, Kriti Puniyani, Sujatha Das 
May 2006 Proceedings of the 15th international conference on World Wide Web 

WWW '06 
Publisher: ACM Press 

Full text available: ^ pdf(336.62 KB) Additional Information: full citation , abstract , references , index terms 

We introduce a new, powerful class of text proximity queries: find an instance of a given 
"answer type" (person, place, distance) near "selector" tokens matching given literals or 
satisfying given ground predicates. An example query is type=distance NEAR Hamburg 
Munich. Nearness is defined as a flexible, trainable parameterized aggregation function of 
the selectors, their frequency in the corpus, and their distance from the candidate answer. 
Such queries provide a key data reduction step for inf ... 

Keywords: indexing annotated text 



2 Industrial and practical experience track paper session 2: The infocious web search 




engine: i mproving we b searching throu gh linguistic analysis 
^ Alexandros Ntoulas, Gerald Chao, Junghoo Cho 

May 2005 Special interest tracks and posters of the 14th international conference on 

World Wide Web 
Publisher: ACM Press 

Full text available: ^ pdf(227.88 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we present the I nfocious Web search engi ne [23] . Our goal in creating 
Infocious is to improve the way people find information on the Web by resolving 
ambiguities present in natural language text. This is achieved by performing linguistic 
analysis on the content of the Web pages we index, which is a departure from existing 
Web search engines that return results mainly based on keyword matching. This 
additional step of linguistic processing gives Infocious two main advantages. First, ... 

Keywords: concept extraction, crawling, indexing, information retrieval, language 
analysis, linguistic analysis of web text, natural language processing, part-of-speech 
tagging, phrase identification, web search engine, web searching, word sense 
disambiguation 
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engine: improving web searching through linguistic analysis 
^ Alexandros Ntoulas, Gerald Chao, Junghoo Cho 

May 2005 Special interest tracks and posters of the 14th international conference on 

World Wide Web 
Publisher: ACM Press 

Full text available: l || pdf(227.88 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we present the Infocious Web search engine [23]. Our goal in creating 
Infocious is to improve the way people find information on the Web by resolving 
ambiguities present in natural language text. This is achieved by performing linguistic 
analysis on the content of the Web pages we index, which is a departure from existing 
Web search engines that return results mainly based on keyword matching. This 
additional step of linguistic processing gives Infocious two main advantages. First, ... 

Keywords: concept extraction, crawling, indexing, information retrieval, language 
analysis, linguistic analysis of web text, natural language processing, part-of-speech 
tagging, phrase identification, web search engine, web searching, word sense 
disambiguation 



2 Plagiarism detection across programming languages 
Christian Arwin, S. M. M. Tahaghoghi 

January 2006 Proceedings of the 29th Australasian Computer Science Conference - 

Volume 48 ACSC 06 
Publisher: Australian Computer Society, Inc. 

Full text available: ||) pdf(193.09 KB) Additional Information: full citation , abstract , references , index terms 

Plagiarism is a widespread problem in assessment tasks; in computing courses, students 
often plagiarise source code. For all but the smallest classes, manual detection of such 
plagiarism is impractical, and, while automated tools are available, none has been applied 
to detect inter-lingual plagiarism, where source code is copied from one language to 
another. In this work, we propose a novel approach, XPlag, to detect plagiarism involving 
multiple languages using intermediate program code produce ... 
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1 Emerging threats: Search worms 
Niels Provos, Joe McClain, Ke Wang 

November 2006 Proceedings of the 4th ACM workshop on Recurring malcode WORM 
"06 

Publisher: ACM Press 

Full text available: * g) pdf(537.46 KB) Additional Information: full citation , abstract , references , index terms 

Worms are becoming more virulent at the same time as operating system improvements 
try to contain them. Recent research demonstrates several effective methods to detect 
and prevent randomly scanning worms from spreading [2, 13]. As a result, worm authors 
are looking for new ways to acquire vulnerable targets without relying on randomly 
scanning for them. It is often possible to find vulnerable web servers by sending carefully 
crafted queries to search engines. Search wormsl automate this approach ... 

Keywords: index filtering, internet worms, search, security, signature 
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2 Building searchable collections of enterprise speech data 

James W. Cooper, Mahesh Viswanathan, Donna Byron, Margaret Chan 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 

libraries 
Publisher: ACM Press 

Full text available: "fg pdf(356.53 KB) Add itiona information: full citation , abstract , references , index terms 

We have applied speech recognition and text-mining technologies to a set of recorded 
outbound marketing calls and analyzed the results. Since speaker-independent speech 
recognition technology results in a significantly lower recognition rate than that found 
when the recognizer is trained for a particular speaker, we applied a number of post- 
processing algorithms to the output of the recognizer to render it suitable for the Textract 
text mining system. We indexed the call transcri ... 

Keywords: document display, search, speech analysis, speech retrieval, text mining 
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The social impact from the World Wide Web cannot be underestimated, but technologies 
used to build the Web are also revolutionizing the sharing of business and government 
information within intranets. In many ways the lessons learned from the Internet carry 
over directly to intranets, but others do not apply. In particular, the social forces that 
guide the development of intranets are quite different, and the determination of a "good 
answer" for intranet search is quite different than on the Int ... 
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Weblogs and message boards provide online forums for discussion that record the voice of 
the public. Woven into this mass of discussion is a wide range of opinion and commentary 
about consumer products. This presents an opportunity for companies to understand and 
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The social impact from the World Wide Web cannot be underestimated, but technologies 
used to build the Web are also revolutionizing the sharing of business and government 
information within intranets. In many ways the lessons learned from the Internet carry 
over directly to intranets, but others do not apply. In particular, the social forces that 
guide the development of intranets are quite different, and the determination of a "good 
answer" for intranet search is quite different than on the Int ... 

Building a distributed full-text index for the web 
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We identify crucial design issues in building a distributed inverted index for a large 
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